Automatic Speech Recognition and Pronunciation Error Detection of Dutch Non-native Speech: cumulating speech resources in a pluricentric language

نویسندگان

چکیده

• Improving the performance of Automatic Speech Recognition (ASR) on learner's speech in a non-dominant variety pluricentric language by cumulating resources from different varieties same language. Through transfer learning approach, knowledge dominant can be transferred to and this benefits non-native recognition. Introducing plausible pronunciation errors native corpus based evaluate Pronunciation Error Detection (PED) algorithms. The shortage large-scale learners’ corpora precise manual annotations are two major challenges for automatic L2 recognition error detection speech, especially languages. In these cases, collecting annotating large (L2 learner) all is often unattainable. study, we investigated ways addressing problems through conventional Deep Neural Network (DNN) ASR-based Netherlandic Dutch Flemish resources. First, show that ASR baseline system improved combining datasets. Next, learned models trained data, learners' model further improved. order PED algorithms absence learner data with annotations, introduced simulate errors. For found results much better GOP classifier than one data. produced worse when were merged while ASR, lower WERs attained. Whether adding beneficial, thus seems depend specific task used for. We discuss results, compare them those related research suggest avenues future research.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic pronunciation error detection in non-native speech: the case of vowel errors in Dutch.

This research is aimed at analyzing and improving automatic pronunciation error detection in a second language. Dutch vowels spoken by adult non-native learners of Dutch are used as a test case. A first study on Dutch pronunciation by L2 learners with different L1s revealed that vowel pronunciation errors are relatively frequent and often concern subtle acoustic differences between the realizat...

متن کامل

Non-native Pronunciation Variation Modeling for Automatic Speech Recognition

Communication using speech is inherently natural, with this ability of communication unconsciously acquired in a step-by-step manner throughout life. In order to explore the benefits of speech communication in devices, there have been many research works performed over the past several decades. As a result, automatic speech recognition (ASR) systems have been deployed in a range of applications...

متن کامل

Lexical modeling of non-native speech for automatic speech recognition

This paper examines the recognition of non-native speech in jupiter, a speaker-independent, spontaneous-speech conversational system. Because the non-native speech in this domain is limited and varied, speakerand accent-specific methods are impractical. We therefore chose to model all of the non-native data with a single model. In particular, this paper describes an attempt to better model non-...

متن کامل

Automatic Evaluation of Dutch Pronunciation by Using Speech Recognition Technology

The ultimate aim of the research reported on in this paper is to de­ velop a system for automatic assessment of foreign speakers’ pronuncia­ tion of Dutch. The aim of the experiment described here was to deter­ mine whether pronunciation ratings assigned by human experts could be predicted on the basis of scores calculated by an automatic speech recog­ nizer. To this end 20 native and 60 non-na...

متن کامل

Automatic Evaluation of Dutch Pronunciation by Using Speech Recognition Technology

The ultimate aim of the research reported on in this paper is to develop a system for automatic assessment of foreign speakers' pronunciation of Dutch. The aim of the experiment described here was to determine whether pronunciation ratings assigned by human experts could be predicted on the basis of scores calculated by an automatic speech recog-nizer. To this end 20 native and 60 non-native sp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Speech Communication

سال: 2022

ISSN: ['1872-7182', '0167-6393']

DOI: https://doi.org/10.1016/j.specom.2022.08.004